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In this article, which is based on the first part of my PhD thesis, 1 review the statistics of the open string 
sector in T^/{Z2 x Z2) orientifold compactifications of the type llA string. After an introduction to the 
orientifold setup, 1 discuss the two different techniques that have been developed to analyse the gauge sector 
statistics, using either a saddle point approximation or a direct computer based method. The two approaches 
are explained and compared by means of eight- and six-dimensional toy models. In the four-dimensional 
case the results are presented in detail. Special emphasis is put on models containing phenomenologically 
interesting gauge groups and chiral matter, in particular those containing a standard model or SU(5) part. 
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1 Introduction 

Starting from early observations of Lerche, Liist and Schellekens f8l |, it has become clear over the years 
that string theory does provide us not only with one consistent low energy effective theory, but with a 
multitude of solutions. This phenomenon has been given the name "the landscape" ll97ll94ll (for a recent 
essay on the subject see also |54|). 

It was known from the very first approaches to compactification of string theory to four dimensions 
that there exist many families of solutions due to the so-called moduli. These scalar fields parametrise 
the geometric properties of possible compactification manifolds and their values are generically not fixed. 
It was believed for a long time that some stabilisation mechanism for these moduli would finally lead 
to only one consistent solution. Even though it is way too early to completely abandon this idea, recent 
developments suggest that even after moduli stabilisation there exists a very large number of consistent 
vacuum solutions. Especially the studies of compactifications with fluxes (see e.g. [72 1 and references 
therein) clarified the situation. The effective potential induced by these background fluxes, together with 
non-perturbative effects, allow to fix the values of some or even all of the moduli at a supersymmetric 
minimum. What is surprising is the number of possible minima, which has been estimated L22ij to be of 
the order of 10^"°. So it seems very likely that there exists a very large number of stable vacua in string 
theory that give rise to low energy theories which meet all our criteria on physical observables. 

After the initial work of Douglas |53|, who pointed out that facing these huge numbers the search for 
the vacuum is no longer feasible, recent research has started to focus on the statistical distributions of string 
vacua. This approach relies on the conjecture that, given such a huge number of possible vacua, our world 
can be realized in many different ways and only a statistical analysis might be possible. Treating physical 
theories on a statistical basis is a provocative statement and it has given rise to a sometimes very emotional 
debate. Basic criticism is expressed in |(9][l0l, where the authors emphasise the point that, as long as we do 
not have a non-perturbative description of string theory, such reasoning seems to be premature. Moreover 
such an approach immediately rises philosophical questions. How can we talk seriously about the idea to 
abandon unambiguous predictions of reality and replace it with statistical reasoning? One is reminded to 
similar questions concerning quantum mechanics, but there is a major difference to this problem. In the 
case of quantum mechanics there is a clean definition of observer and measurement. Most importantly, 
measurements can be repeated and therefore we can make sense out of a statistical statement. In the case 
of our universe we have just one measurement and there is no hope to repeat the experiment. 

At the moment there are two roads visible that might lead to a solution of these problems. One of them 
is based on anthropic arguments 19711 . which have already been used outside string theory to explain the 
observed value of the cosmological constant f 1001. Combined with the landscape picture this gives rise 
to the idea of a multiverse, where all possible solutions for a string vacuum are actually realised ifTsll (for 
a recent essay on the cosmological constant problem and the string landscape see also |91 1). Anthropic 
reasoning is not very satisfactory, especially within the framework of a theory that is believed to be unique. 
Another possible way to deal with the landscape might therefore be the assignment of an entropy to the 
different vacuum solutions and their interpretation in terms of a Hartle-Hawking wave function lf86ll24l . 
A principle of extremisation of the entropy could then be used to determine the correct vacuum. 

We do not dwell into philosophical aspects of the landscape problem in this work, but rather take a very 
pragmatic point of view, following Feynman's "shut up and calculate" attitude. In this endeavour a lot of 
work has been done to analyse the properties and define a suitable statistical measure in the closed string 
sector of the theory ||6l[33 42 66, 51 , 84, 32, 40, 44, 47, 50, 1 , 52]|55l|76l. In this work we are focusing on 
the statistics of the open string sector Inl [77l |78l [5] 113 1^ |68l |2ll |79l IS trying to take 

the most general point of view and analyse a generic statistical distribution, but focus instead on a very 
specific class of models. In this small region of the landscape we are going to compute almost all possible 
solutions and give an estimate for those solutions we were not able to take into account. 



Note that in this estimate not all effects from the process of moduli stabilisation have been taken into account. 
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There are several interesting questions one can ask, given a large set of possible models. One of them 
concerns the frequency distribution of properties, like the total rank of the gauge group or the occurrence 
of certain gauge factors. Another question concerns the correlation of observables in these models. This 
question is particularly interesting, since a non-trivial correlation of properties could lead to the exclusion 
of certain regions of the landscape or give hints where to look for realistic models. It should be stressed that 
in our analysis of realistic four-dimensional compactifications we are not dealing with an abstract statistical 
measure, but with explicit constructions. 

1.1 Outline 

This paper is based on the first part of the author's PhD thesis ll67l and is structured as follows. In section|2] 
we prepare the stage, introducing the special class of type II orientifold models that are our objects of 
interest. Moreover we explain the two methods we use to analyse these models. On the one hand, the 
saddle point approximation and on the other hand a brute force computer algorithm for explicit calculations. 
Concerning this algorithm, we comment on its computational complexity, which touches a more general 
issue about computations in the landscape. In the last section we discuss another fundamental problem 
of the statistical analysis, namely the finiteness of vacua. An analytic proof of finiteness seems to be out 
of reach, but we give several numerical arguments that support the conjecture that the total number of 
solutions is indeed finite. 

In section [3] we apply the described methods to type II orientifold models. We begin with general 
questions about the frequency distributions of properties of the gauge sector in compactifications to six and 
four dimensions. After that we select several subsets of models for a more detailed analysis. We choose 
those subsets that could provide us with a phenomenologically interesting low energy gauge group. This 
includes first of all the standard model, but in addition constructions of Pati-Salam, SU{5) and flipped 
SU (5) models. In the case of standard model-like constructions we investigate the relations and frequency 
distributions of the gauge coupling constants and compare the results with a recent analysis of Gepner 
models P8l|49l . In the last part of this section the question of correlations of gauge sector observables is 
explored. 

Finally we sum up our results and give an outlook to further directions of research in section |4] In 
appendix lAlwe summarise some useful formulae for the different orientifold models. AppendixlBlcontains 
details about the implementation of the computer algorithm, used to construct the models that have been 
analysed. 

2 Models and methods 

As explained in the introduction, our program to classify a subset of the landscape of string vacua is 
performed on a very specific set of models. In this section, we want to set the stage for the analysis, 
explain the construction and the constraints on possible solutions. Moreover, we have to develop the 
necessary tools of analysis. 

In the first part of this section we give a general introduction to the orientifolds we are planning to anal- 
yse. We focus on the consistency conditions that have to be met by any stable solution. In particular these 
are the tadpole conditions for the R-R fields, the supersymmetry conditions on the three-cycles wrapped 
by D-branes and orientifold planes and restrictions coming from the requirement of anomaly cancellation. 

In the second part we develop the tools for a statistical analysis and test them on a very simple compact- 
ification to eight dimensions. There are two methods that we use for six- and four-dimensional models in 
the next section, namely an approximative method and a direct, brute force analysis. The first method re- 
lies on the saddle point approximation, which we explain in detail and compare it with known results from 
number theory. For the second method we describe an algorithm that can be used for a large scale search 
performed on several computer clusters. To estimate the amount of time needed to generate a suitable 
amount of solutions, we analyse the computational complexity of this algorithm. 
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In the last part of this section we investigate the problem of finiteness of the number of solutions, an 
issue that is important to judge the validity of the statistical statements. 

2.1 Orientifold models 

Let us give a brief introduction to the orientifold models we use in the following to do a statistical analysis. 
We wiU not try to give a complete introduction to the subject, for readers with interest in more background 
material, we refer to the available textbooks 1 89 74] and reviews ll92l [88l l60l for a general introduction 
and the recent review [il5 j for an account of orientifold models and their phenomenological aspects. 

Our analysis is based on the study of supersymmetric toroidal type II orientifold models with inter- 
secting D-branes lfT2l l4l l8l l85l l95l . These models are, of course, far from being the most general com- 
pactifications, but they have the great advantage of being very well understood. In particular, the basic 
constraints for model building, namely the tadpole cancellation conditions, the supersymmetry and K- 
theory constraints, are well known. It is therefore possible to classify almost all possible solutions for 
these constructions. 

The orientifold models we consider can be described in type IIB string theory using space-filUng D9- 
branes with background gauge fields on their worldvolume. An equivalent description can be given in 
the T-dual type IIA picture, where the D9-branes are replaced by D6-branes, which intersect at non-trivial 
angles. This point of view is geometrically appealing and goes under the name of intersecting D6-branes. 
We use this description in the following. 

The orientifold projection is given by 1)^^, where fl : (cr, r) (— ct, r) defines the world- 

sheet parity transformation and a is an isometric anti-holomorphic involution, which we choose to be 
simply complex conjugation in local coordinates: ct : z — > z. denotes the left-moving space-time 
fermion number. This projection introduces topological defects in the geometry, the so-called orientifold 
06-planes. These are non-dynamical objects, localised at the fixed point locus of a, which caiTy tension 
and charge under the R-R seven-form, opposite to those of the D6-branefl 

Both, the 06-planes and D6-branes wrap three-cycles tt G H^{M, Z) in the internal Calabi-Yau man- 
ifold M, which, in order to preserve half of the supersymmetry, have to be special Lagrangian. Since the 
charge of the orientifolds is fixed and we are dealing with a compact manifold, the induced R-R and NS-NS 
tadpoles have to be cancelled by a choice of D6-branes. These two conditions, preserving supersymmetry 
and cancelling the tadpoles, are the basic model building constraints we have to take into account. 

The homology group H^{AI, Z) of three-cycles in the compact manifold Ad splits under the action of 
Via into an even and an odd part, such that the only non-vanishing intersections are between odd and even 
cycles. We can therefore choose a symplectic basis (a/, (3i) and expand tTq and vr^ as 



63/2 63/2 

E 

1=1 7=1 



Xiai + YlPi) , < = ^ {Xiai - Y^Pi) , (1) 



and TToe as 



-, "3/2 

7=1 

where 63 is the third Betti-Number of M, counting the number of three-cycles. 
2.1.1 Chiral matter 

Chiral matter arises at the intersection of branes wrapping different three-cycles. Generically we get bi- 
fundamental representations (Nq, N^) and (Na, N;,) of U{Na) x U (Nt,) for two stacks with Na and Nt, 



2 

It is also possible to introduce orientifold planes with different charges, but we consider only those with negative tension and 
charge in this work. 
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branes. The former arise at the intersection of brane a and brane 6, the latter at the intersection of brane a 
and the orientifold image of brane b, denoted by 5'. An example is shown in figure[T] 

Na=3 




Fig. 1 We find chiral matter at the intersection of two stacks of branes. The representation is given in terms of the 
number of branes of each stack. 



In addition we get matter transforming in symmetric or antisymmetric representations of the gauge 
group for each individual stack. The multiplicities of these representations are given by the intersection 
numbers of the three-cycles, 

63/2 

lab -.^naont^Y. i^aYb' - X^Y^) . (3) 

7=1 

The possible representations are summarized in table [T] where Sym„ and Antia are the symmetric and 
antisymmetric representations of U (Na)- 



representations 


multiplicity 


(N„,Nb) 


TTq TTb = lab 


(Na,N,) 


TT^ TTh = lab' 




I (tTo on'a-TTaO TTQe) = ^{ha' " ^Oe) 


Antia 





Table 1 Multiplicities of the chiral spectrum. 



2.1.2 Tadpole cancellation conditions 

The D6-branes in our models are charged under a R-R seven-form |87|. Since the internal manifold is 
compact, as a simple consequence of the Gauss law, all R-R charges have to add up to zero. These so- 
called tadpole cancellation conditions can be obtained considering the part of the supergravity Lagrangian 
that contains the coiTesponding contributions. In particular we do not only get contributions from k stacks 
of branes, wrapping cycles tTq, but in addition terms from the orientifold mirrors of these branes, wrapping 
cycles TT^, and the 06-planes. 



k 



5 = / dCJ/^*dCr + ^i(>y ]Na\ I C7 + / C7 I - 4/^6 / C7, (4) 



4k^ J ^ 

XxM \XX7ra Xxtt' / Xxttob 
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where the ten dimensional gravitational coupling is given by = i(27r)^(a')"', the R-R charge is denoted 
by ^6 = («')"^ (27r)^^ and X denotes the uncompactified space-time. 

From this we can derive the equations of motion for the R-R field strength Gs = dCj to be 

d*Gs = ^Ve ('^('^-) + ^(O) - 4^(^06)^ • (5) 

In this equation ^(vr) denotes the Poincare dual three form of a cycle tt. Noticing that the left hand side 
of ^ is exact, we can rewrite this as a condition in homology as 

fe 

^7V,(^„ + = 4^06 (6) 

a=l 

We do not have to worry about the NS-NS tadpoles, as long as we are considering supersymmetric 
models, since the supersymmetry conditions together with R-R tadpole cancellation ensure that there are 
no NS-NS tadpoles. In the following we consider supersymmetric models only. 



2.1.3 Supersymmetry conditions 

Since we want to analyse supersymmetric models, it is crucial that the D-branes and O-planes preserve half 
of the target-space supersymmetry. It can be shown lf83l that this requirement is equivalent to a calibration 
condition on the cycles, 

3(r!3)k„-0, 5R(r!3)k„>0, (7) 

where SI3 is the holomorphic 3-form. The second equation in (O excludes anti-branes from the spectrum. 
Written in the symplectic basis ([T]l, these equations read 

63/2 63/2 

^ Y^fj = 0, ^aui > 0, (8) 

/=! 1=1 

where we defined 

I J ai 



2.1 A Anomalies and K- theory constraints 

If the tadpole cancellation conditions ^ are satisfied, there are no cubic anomalies of SU{N) gauge 
groups in our models. What we do have to worry about are mixed anomalies, containing abelian factors. 
The mixed anomaly for branes stretching between two stacks a and 6 with Na = 1 and iVf, > 1 branes per 
stack, looks like 

Auil)^-SU(N), - Na{Iab + Iab')c2{Nb) = -27V„y,XfcC2 (TVfc) , (9) 

where C2{Nh) denotes the value of the quadratic Casimir operator for the fundamental representation of 

SU{Nb). 

The cubic anomaly consisting of three abelian factors is cancelled by the Green-Schwarz mechanism. 
This makes these C/(l)s massive and projects them out of the low energy spectrum. But in some cases, for 
example in the case of a standard model-like gauge group or for flipped SU (5) models, we want to get a 
massless U{1) factor. A sufficient condition to get such a massless U{\) in one of our models is that the 
anomaly (|9]l vanishes. 
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This can be archived, if the U{1), defined in general by a combination of several U{1) factors as 
fc 

C/(l) = ^XaC/(l)a, (10) 
a=l 

fulfills the following relations, 

fc 

J^^aNaYa^O. (11) 

a=l 

Inserting this into (|9]l shows that A vanishes. 

Besides these local gauge anomalies, there is also the potential danger of getting a global gauge anomaly, 
which would make the whole model inconsistent. This anomaly arises if a Z2 -valued K-theory charge 
is not conserved |98|. In our case this anomaly can be derived by introducing 5*^(2) probe branes on 
top of the orientifold planes and compute their intersection numbers with all branes in the model. This 
intersection number has to be even, otherwise we would get an odd number of fermions, transforming in 
the fundamental representation of Sp{2) I.101J . 



2.2 Methods of D-brane statistics 

To analyse a large class of models in the orientifold setting described in the last section, we have to develop 
some tools that allow us to generate as many solutions to the supersymmetry, tadpole and K-theory condi- 
tions as possible. It turns out that the most difficult part of this problem can be reduced to a purely number 
theoretical question, namely the problem of counting partitions of natural numbers. This insight allows 
us to use an approximative method, the saddle point approximation that we introduce in section l2".2.1l and 
apply to a simple toy-model in 12. 2. 21 Unfortunately it turns out that this method is not very well suited to 
study the most interesting compactifications, namely those down to four dimensions. Therefore we have 
to change the method of analysis in that case to a more direct one, using a brute force, exact computer 
analysis. The algorithm used to do so is described in section l?.2.3l 



2.2.1 Introduction to the saddle point approximation 

As an approximative method to analyse the gauge sector of type II orientifolds, the saddle point approx- 
imation has been introduced in ifTTl . In the following we begin with a very simple, eight-dimensional 
model, in order to explain the method. 

In the most simple case, a compactification to eight dimensions on T^, the susy conditions reduce to 
Ya = and Xa > and the tadpole cancellation conditions are given by 

fc 

Y,NaXa = 16, (12) 

a=l 

as shown in appendix lA.il 

The task to count the number of solutions to this equation for an arbitrary number of stacks fc is a 
combination of a partitioning and factorisation problem. Let us take things slowly and start with a pure 
partitioning problem, namely to count the unordered solutions of 

fc 

5]^a = i. (13) 

a=l 

This is nothing else but the number of unordered partitions of L. Since we are not interested in an exact 
solution, but rather an approximative result, suitable for a statistical analysis and further generalisation to 
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the more ambitious task of solving the tadpole equation, let us attack this by means of the saddle point 
approximation Il2l ll02l . 

As a first step to solve JTSl l. let's consider 



(14) 



fc=i 



where we do not have to worry about the ordering problem. We can rewrite this as 



all '' ^ nfc=0 '' ^ ^ ^ 

(15) 

To evaluate integrals of this type in an asymptotic expansion, the saddle point method is a commonly used 
tool. In the following we describe its application in detail. The last line of (fTSI l can be written as 

AA(L) = -^ /dgexp(/(g)), with /(g) = - log(l - g") - (L + 1) logg. (16) 

fe=l 

Now we are going to assume that the main contributions to this integral come from saddle points q^, 
determined by df /dq\^ ~ 0. In the following we work with only one saddle point at q = q^, the gen- 
eralisation to many points is always straightforward. Using the decomposition q — pex-p{i(p) we get 



— 7r 

Performing a Taylor expansion in ip 

f{po,(fi)^f{qo)+-Tj-^ ^2 + ..., 

we can compute (fTTI l to arbitrary order by inserting the corresponding terms from ( fTsT l. 
The leading order term is simply given by 

AA(")(L)=exp(/(go)), 
and the first correction at next-to-leading order by 



(17) 



(18) 



AAil(L)^i-/..exp(^-i0L.j 



(19) 



(20) 



where we defined x :— qo(p and used that {d'^ f /d(p'^)qg = —q^ {d^ f /dq^)qg. For d^f/dq^ large enough 
we finally obtain the result for the saddle point approximation including next-to-leading order corrections 

-1/2 

(21) 



AA(^)W = ^exp(/M)(0|J 



The same procedure can also be performed for functions of several variables. The integral to approxi- 
mate this situation looks like 



JV{L)^ — fUdq-eMfm, 

1=1 



(22) 
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with / being of the form 

N 



f{q)=9{q)-Y.{Li + l)logqj. (23) 



1=1 



We can perform the saddle point approximation around 'Vf{q)\^g = in the same way as above and obtain 
the following result at next-to-leading order 

AA(2)(L) = (2^)-"/2 exp(j(^-^^))(^detHess/(g)|^^J . (24) 

In the simple case discussed so far, contrary to the more complicated cases we encounter later, an 
analytic evaluation of the leading order contribution is possible. For large L the integrand of (fTSb quickly 
approaches infinity for q < 1 and 5 ~ 1. One expects a sharp minimum close to 1, which would be the 
saddle point we are looking for. 

Close to q ~ 1 we can write the first term in ( fT6] l as 



(25) 



^ 1 1 1 2 1 

^ — ' ^ — ' m I — Q — rn'' b 1 — a 

k=l k,m>0 m>0 ^ 

such that we can approximate f{q) by 

f{q)^^J—-{L + l)logq. (26) 
D 1 — q 

For large values of L, the minimum of this function is approximately at go — 1 ~ \/ ff which leads to 



/(go) ~ 7r-\/2i/3. Inserting this into (fT9l l gives a first estimate of the growth of the partitions for large L 
to be 

Af{L) ~ exp (^VWs) • (27) 

This is precisely the leading term in the Hardy-Ramanujan formula ll73l for the asymptotic growth of the 
number of partitions 

AA(L)(^^) ^ ^ exp (7rv/2l73) . (28) 

In figure|2]the results of an exact calculation, using the partition algorithm described in appendix|B] and 
the saddle point approximation in leading and next-to-leading order are shown. 



2.2.2 A first application of the saddle point approximation 

After this introduction to the saddle point method let us come back to our original problem. To solve 
equation (fT2] |. we first have to transfer our approximation method to (fTSI l and then include the factorisation 
in the computation. This last step turns out not to be too difficult, but in order to use the technique developed 
above, we have to be a bit careful about the ordering of solutions. 

In the example we presented to introduce the method, we did not have to worry about the ordering, 
since it was solved implicitly by the definition of the partition function. This is not the case for ( fTsT l. such 
that by simply copying from above the result is too large. We should divide the result by the product of the 
number of possibilities to order each partition. Obtaining this factor precisely is very difficult and since we 
are only interested in an approximative result anyway, we should try to estimate the term. Such an estimate 
can be made dividing by fc!, where k is the total number of stacks. This restricts the number of solutions 
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more than necessary, because the factor is too high for partitions that contain the same element more than 
once. Let us nevertheless calculate the result with this rough estimate and see what comes out. 

Repeating the steps from above, we can rewrite (fT3T l to obtain 



M{L) 



^ k=l 1=1 \Ni = l I J ^ \n=\ / 

^ ^/d 1 yj_ f = 1 expf^y (29) 

Applying the saddle point approximation as explained above for the function 

/('?) = 7^ - (i + 1) log g, (30) 
we get for the number of solutions of ( fT3b the estimate 

7V'(L) ~ exp(2Vl). (31) 

Comparing this result with dZTl i shows that we get the correct exponential growth, but the coefficient is too 
small by a factor 

log A/" TT 

— ^ = ^ =i 1.28. (32) 
logAA Ve 

In figure [3] we compare the results for the leading and next-to-leading order results of the computation 
above with the exact result. As already expected, the value for the second order approximation is too 
small, since our suppression factor kl is too big. Nevertheless, qualitatively the results are correct. Since 
we are not aiming at exact results, but rather at an approximative method to get an idea of the frequency 
distributions of properties of the models under consideration, this is not a big problem. 
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Ln ( N (L) ) 




Fig. 3 Comparing the results for the number of partitions of L. The solid line is the exact result, the dotted line is the 
saddle point approximation to leading order. The stars and triangles show the next-to-leading order result, without and 
including the additional analytic factor 1.28, respectively. 



Let us finally come back to the full tadpole equation (fT2] i. It can be treated in the same way as the pure 
partition problem and analogous to ( |29] l we can write 



oc L 



^ k=l 4=1 \Ni = lXi = l / 

If I °° I ( ^ \^ 

k—\ \^=1 / 

such that we obtain for / 

L X 



Close to g ~ 1 we can approximate this to 

L 

1 



/(g) ~ -1^^ 1-Llogg ~ }^-L\ogq. 



The minimum can then be found at go — 1 — which gives for the number of solutions 



(35) 



Af{L) ~ exp(2Vilogi). (36) 

The additional factor of log L in the scaling behaviour compared to (ISTT l can be explained by a result from 
number theory. It is known that the function (To(n), counting number of divisors of an integer n, has the 
property 

1 ^ 

-^(To(n)~logL+(27B-l), (37) 
where is the Euler-Mascheroni constant. 
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N(L) 

5r 




Fig. 4 Logarithmic plot of the number of solutions to the supersymmetry and tadpole equations for compactifications 
on T^. The dotted line shows the exact results, the solid line is the result of a next-to-leading order saddle point 
approximation. 



Let us compare the result ( 1361 ) with the exact number of solutions, obtained with a brute force computer 
analysis. This is shown in figureH] As expected from the discussion above, the estimate using the saddle 
point approximation is too small, but it has the correct scaling behaviour and should therefore be suitable 
to qualitatively analyse the properties of the solutions. 

We can use the saddle point approximation method introduced above to analyse several properties of 
the gauge sector of the models. To show how this works, we present two examples in the simple eight- 
dimensional case, before applying these methods in section [TTI to models on T"'/Z2. 

One interesting observable is the probability to find an SU{]\I) gauge factor in the total set of models. 
Using the same reasoning as in the computation of the number of models this is given by 

The saddle point function is therefore given by 

L X /I ML \ 

m = E ^ + log ['^''t^) - + ^) ^^'^ 

A comparison between exact computer results and the saddle point approximation to second order is shown 
in figure |5(a)| 

Another observable we are interested in is the distribution of the total rank of the gauge group in our 
models. This amounts to including a constraint 



oo 
a=l 



(40) 
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Ln [P (M) ] 




(a) (b) 



Fig. 5 Distributions for compactifications on . The solid lines are the exact result, the dotted lines represent the 
second order saddle point approximation, (a) Probability to find at least one SU (M) gauge factor, (b) Frequency 
distribution of the total rank. 



that fixes the total rank to a specific value r. This constraint can be accounted for by adding an additional 
delta-function, represented by an additional contour integral to our formula. We obtain 



OG L 



,E.iv..v..E.iv. 



with saddle point function 

L X 

f{q,z)^Y. 1 "(£+l)logg-(r + l)logz. (42) 

As we can see in figure [5(b)l where we also show the exact computer result, we get a Gaussian distribution. 

2.2.3 Exact computations 

Instead of using an approximative method, it is also possible to directly calculate possible solutions to 
the constraining equations. At least for models on or T'^ /X2, this is much more time-consuming than 
the saddle point approximation, and, what is even more important, cannot be done completely for models 
on /'L2 X Z2. The reason why a complete classification is not possible has to do with the fact that the 
problem to find solutions to the supersymmetry and tadpole equations belongs to the class of NP-complete 
problems, an issue that we elaborate on in section l23] Despite these difficulties, it turns out to be necessary 
to use an explicit calculation for four-dimensional compactifications, the ones we are most interested in, 
since the saddle point method does not lead to reliable results in that case. 

In the eight-dimensional case the algorithmic solution to the tadpole equation 

Y^NaXa^ L, (43) 



can be formulated as a two-step algorithm. First calculate all possible unordered partitions of L, then 
find all possible factorisations to obtain solutions for X and N. The task of partitioning is solved by the 
algorithm explained in appendix|B] the factorisation can only be handled by brute force. In this way we 
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are not able to calculate solutions up to very high values for L, but for our purposes, namely to check the 
validity of the saddle point approximation (see section l2".2.2l l. the method is sufficient. 

In the case of compactifications to six dimensions we can still use the same method, although we now 
have to take care of two additional constraints. First of all we exclude multiple wrapping, which gives an 
additional constraint on the wrapping numbers Xi , X2 , Yi and Y2 , defined in appendix lA.3l This constraint 
can be formulated in terms of the greatest common divisors of the wrapping numbers - we will come back 
to this issue in section [TT] Another difference compared to the eight dimensional case is that we have to 
take different values for the complex structure parameters Ui and U2 (see appendix lA. 31 for a definition) 
into account. As it is shown in section 12.31 these are bounded from above and we have to sum over all 
possible values, making sure that we are not double counting solutions with wrapping numbers which 
allow for different values of the complex structures. 

In ( |99l ) the wrapping numbers X and Y are defined as integer valued quantities in order to implement 
the supersymmetry (1103b and tadpole (1102b conditions in a fast computer algorithm. From the equations 
we can derive the following inequalities 

3 3 

< ^X^L// < ^L^t//. (44) 

7=0 7=0 

The algorithm to find solutions to these equations and the additional K-theory constraints (1105b consists 
of four steps. 

1. First we choose a set of complex structure variables [//. This is done systematically and leads to a 
loop over all possible values. Furthermore, we have to check for redundancies, which might exist 
because of trivial symmetries under the exchange of two of the three two-tori. 

2. In a second step we determine all possible values for the wrapping numbers X^ and Y^ , using (l44l i 
for the given set of complex structures, thereby obtaining all possible supersymmetric branes. In this 
step we also take care of the multiple wrapping constraint, which can be formulated, analogously to 
the six dimensional case, in terms of the greatest common divisors of the wrapping numbers. 

3. In the third and most time-consuming part, we use the tadpole equations (1102b . which after a summa- 
tion can be written as 

k 3 

J2Sa^A with 5a :=^^a(7/X„^ and A-.^J^L'Ui. (45) 

a=0 7=0 I 

To solve this equation, we note that all Sa and A are positive definite integers, which allows us to use 
the partition algorithm to obtain all possible combinations. The algorithm is improved by using only 
those values for the elements of the partition which are in the list of values we computed in the second 
step. For a detailed description of the explicit algorithm we used, see appendix IbI Having obtained 
the possible Sa, we have to factorise them into values for Na and X^. 

4. Since i45[ is only a necessary but no sufficient condition, we have to check in the fourth and last step, 
if the obtained results indeed satisfy all constraints, especially the individual tadpole cancellation 
conditions and the restrictions from K-theory, which up to this point have not been accounted for at 
all. 

The described algorithm has been implemented in C and was put on several high-performance computer 
clusters, using a total CPU-time of about 4 x 10'^ hours. The solutions obtained in this way have been saved 
in a database for later analysis. 

The main problem of the algorithm described in the last section lies in the fact that its complexity scales 
exponentially with the complex structure parameters. Therefore we are not able to compute up to arbitrarily 
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high values for the Uj. Although we tried our best, it may of course be possible to improve the algorithm 
in many ways, but unfortunately the exponential behaviour cannot be cured unless we might have access 
to a quantum computer. This is due to the fact that the number of possib le solutions to the Diophantine 
equations we are considering grows exponentially with U. In fact, this is quite a severe issue since the 
Diophantine structure of the tadpole equations encountered here is not at all exceptional, but very generic 
for the topological constraints also in other types of string constructions. The problem seems indeed to 
appear generically in computations of landscape statistics, see [41 1 for a general account on this issue. 

As we outlined in the previous section, the computational effort to generate the solutions to be anal- 
ysed in the next section took a significant amount of time, although we used several high-end computer 
clusters. To estimate how many models could be computed in principle, using a computer grid equipped 
with contemporary technology in a reasonable amount of time, the exponential behaviour of the problem 
has to be taken into account. Let us be optimistic and imagine that we would have a total number of 10^ 
processors at our disposal which are twice as fast as the ones we have been using. Expanding our analysis 
to cover a range of complex structures which is twice as large as the one we considered would, in a very 
rough estimate, still take us of the order of 500 years. 

Note that in principle there can be a big difference in the estimated computing time for the two com- 
putational problems of finding all string vacua in a certain class on the one hand, and of looking for 
configurations with special properties, that lead to additional constraints, on the other hand. As we explore 
in section 13.5.31 the computing time can be significantly reduced if we restrict ourselves to a maximum 
number of stacks in the hidden sector and take only configurations of a specific visible sector into account 
(in the example we consider we look for grand unified models with an SU{5) gauge group). Nevertheless, 
although a much larger range of complex structures can be covered, the scaling of the algorithm remains 
unchanged. This means in particular that a cutoff on the Ui, even though it might be at higher values, has 
to be imposed. 

2.3 Finiteness of solutions 

It is an important question whether or not the number of solutions is infinite. Making statistical statements 
about an infinite set of models is much more difficult than to deal with a finite sample, because we would 
have to rely on properties that reoccur at certain intervals, in order to be able to make any valuable state- 
ments at all. If instead the number of solutions is finite, and we can be sure that the solutions we found 
form a representative sample, it is possible to draw conclusions by analysing the frequency distributions of 
properties without worrying about their pattern of occurrence within the space of solutions. 

In the case of compactifications to eight dimensions, the results are clearly finite, as can be seen directly 
from the fact that the variables X and N have to be positive and L has a fixed value. Note however, although 
such an eight-dimensional model is clearly not realistic, that the complex structures are unconstrained. This 
means that if we do not invoke additional methods to fix their values, each solution to the tadpole equation 
represents in fact an infinite family of solutions. 

2.3.1 The six-dimensional c ase 

In the six dimensional case, the finiteness of the number of solutions is not so obvious, but it can be 
rigorously proven. In order to do so, we have to show that possible values for the complex structure 
parameters Ui and U2 are bounded from above. If this were not be the case, we could immediately deduce 
from equations ( 1103b that infinitely many brane configurations would be possible. 

In contrast to the eight-dimensional toy-model that we explored in section l2.2.2l in this case, and also 
for the four-dimensional compactifications, we do not want to allow branes that wrap the torus several 
times. To exclude this, we can derive the following condition on the wrapping numbers (for details see 
appendix lA.2.1l l. 



gcd{X\Y^)gcd{X\Y^) = Y\ 



(46) 
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This condition implies that all X and Y are non-vanishing. Additional branes, which wrap the same cycles 
as the orientifold planes, are given hy X £ {(1, 0), (0, 1)}, with F = in both cases. 

From ( I103I I we conclude that all non-trivial solutions have to obey U1/U2 E Q. Therefore we can 
restrict ourselves to coprime values 



(ui,U2) with Ui 



gcd(C/i,C/2) 



(47) 



With these variables we find from the supersymmetry conditions that = U2 a, for some a £ "L. Now 
we can use the relation (l90t to get 



X^ X"^ = U1U2 



(48) 



In total we get two classes of possible branes, those where X^ and X^ are both positive and those where 
one of them is 0. The latter are those where the branes lie on top of the orientifold planes. 

For fixed values of ui and U2 the tadpole cancellation conditions ( |92l ) admit only a finite set of solutions. 
Since all quantities in these equations are positive, we can furthermore deduce from ( l48T l that solutions 
which contain at least one brane with X^, X^ > are only possible if the complex structures satisfy the 
bound 



U1U2 < L1L2. 

In figure|6]we show the allowed values for m and U2 that satisfy equation ( |49] l. 
u2 
60 I 



(49) 
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Fig. 6 Allowed values for the complex structure parameters ui and it2 for compactifications to six dimensions. 



In the case that only branes with one of the X^ vanishing are present in our model, the complex struc- 
tures are not bounded from above, but since there exist only two such branes in the case of coprime wrap- 
ping numbers, all solutions of this type are already contained in the set of solutions which satisfy (|49T l. 
Therefore we can conclude that the overall number of solutions to the constraining equations in the case of 
compactifications to six dimensions is finit^ 



Note however, that in the case where ah branes he on top of the orientifold planes, we are in an analogous situation for the 
eight-dimensional compactifications. Unless we invoke additional methods of moduli stabilisation, the complex structure moduli 
represent fiat directions and we get infinite families of solutions. 
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2.3.2 Compactifications to four dimensions 

The four dimensional case is very similar to the six-dimensional one discussed above, but some new phe- 
nomena appear. In particular, we see that the wrapping numbers can have negative values, which is the 
crucial point that prevents us from proving the finiteness of solutions. Although we were not able to obtain 
an analytic proof, we present some arguments and numerical results, which provide evidence and make it 
very plausible that the number of solutions is indeed finit^ 

As in the T^/Zz case, we can derive a condition on the (rescaled) wrapping numbers X and Y, defined 
by ( 1109b . to exclude multiple wrapping. The derivation is given in appendix lA.3. H and the result i^ 

3 

Y[gcd{Y',X') = (50) 

1=1 

From the relations (1 100b . it follows that either one, two or all four X^ can be non-vanishing. The case 
with only one of them vanishing is excluded. Let us consider the three possibilities in turn and see what 
we can say about the number of possible solutions in each case. 

1. In the case that only one of the X^ ^ 0, the corresponding brane Ues on top of one of the orientifold 
planes on all three T^. This situation is equivalent to the eight-dimensional case and can be included 
in the discussion of the next possibility. 

2. If two X^ 7^ 0, we are in the situation discussed for the compactification to six dimensions. The two 
X^ have to be positive by means of the supersymmetry condition and one of the complex structures 
is fixed at a rational number Together with the eight-dimensional branes, the same proof of finiteness 
we have given for the T"'/Z2-case can be applied. 

3. A new situation arises for those branes where all X^ ^ 0. Let us discuss this a bit more in detail. 

From the relations ( 1 100b we deduce that an odd number of them has to be negative. In the case that three 
would be negative and one positive - let us without loss of generality choose Xq > - we can write the 
supersymmetry condition (1103b as 



which implies X* Ui < — X° UoViE {1, 2, 3}. This contradicts the second supersymmetry condition, 

Therefore, we conclude that the only remaining possibility is to have one of the X^ < 0. Again we choose 
X'^ without loss of generality. We can now use (ISTb to express X'^ in terms of the other three wrapping 
numbers as 

Furthermore, we can use the inequality (|44] | and derive an upper bound 

3 3 

XI C// > X" J7o + ^ X' Ui > X^ Uj > 0, V j e {1, 2, 3}. (54) 

7=0 i=l 



^ An analytic proof of this statement lias recently been found 1561 . 

^ We have to use rescaled wrapping numbers, as defined by )109t . to write the solution in this simple form. 
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As in the six-dimensional case, we can use the argument that the complex structures are fixed at rational 
values, as long as we take a sufficient number of branes. So we can write them, in analogy to ( l47T i as 
u/ 2/w/,i. Using this definition, we can write ( |54] | as 



1<X,< 



Hp=0 UP,2UQ^lUR^iUS,lLp 



(55) 



for P ^ Q R S ^ P andi ^ J K ^ L ^ i. 

From this we conclude that as long as the complex structures are fixed, we have only a finite number 
of possible brane configurations, i.e. only a finite number of solutions. This is unfortunately not enough 
to conclude that we have only a finite number of solutions in general. We would have to show, as in the 
six-dimensional case, that there exists an upper bound on the complex structures. Since we were not able 
to find an analytic proof that such a bound exists, we have to rely on some numerical hints that it is in fact 
the case. We present some of these hints in the following. 
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Fig. 7 The number of unique solutions for compactifications on T^/Z2 x Z2, taking Lj — 2 \/I € {0, . . . , 3}. The 
horizontal axis shows combinations of the Ui, ordered by their absolute value \U\. For each of these values we plotted 
the cumulative set of solutions obtained up to this point. 



Figure [7] shows how the total number of mutually different brane configurations for L = 2 increases 
and saturates, as we include more and more combinations of values for the complex structures Uj into the 
set for which we construct solutions. For this small value of L our algorithm actually admits pushing the 
computations up to those complex structures where obviously no additional brane solutions exist. 

For the physically relevant case of L = 8 the total number of models compared to the absolute value | U \ 
of the complex structure variables scales as displayed in figure[8] The plot shows all complex structures we 
have actually been able to analyse systematically. We find that the number of solutions falls logarithmically 
for increasing values of |C/|. In order to interpret this result, we observe that the complex structure moduli 
Ui are only defined up to an overall rescaling by the volume modulus of the compact space. We have 
chosen all radii and thereby also all Uj to be integer valued, which means that large \U\ correspond to 
large coprime values of R\ and i?2 ■ This comprises on the one hand decompactification limits which 
have to be discarded in any case for phenomenological reasons, but on the other hand also tori which are 
slightly distorted, e.g. almost square tori with iJj'V-^^i*^ = 0.99. 

3 Statistical analysis of orientifold models 

After preparing the stage in the last section, introducing the models and methods of analysis, we are now 
going to analyse some specific constructions of phenomenological interest. At the end of this section we 
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Fig. 8 Logarithmic plot of the absolute number of solutions for compactifications on T /Z2 x Z2 using the physical 
values I// = 8 V/ G {0, . • . , 3} against the absolute value \U\. The cutoff is set at |(7j = 12. In this plot, as in all 
other plots of this paper, we use a decadic logarithm. 



want to arrive at a point where we can make some meaningful statistical statements about the probability 
to find realisations of the standard model or GUT models in the specific set of models we are considering. 

However, it is important to mention, that our results cannot be regarded to be complete. First of all 
we neglect the impact of fluxes, which does not change the distributions completely, but definitely has 
some influence. Secondly, we are considering only very specific geometries. Since the construction of 
the orientifolds, especially the choice of the orbifold group which in our case is always Z2, has a strong 
impact on the constraining equations, it is very probable that the results change significantly once we use 
a different compactification space. Nevertheless we think that these results are one step towards a deeper 
understanding of open string statistics. 

In the first part of this section we discuss some general aspects of compactifications to six and four 
dimensions. We analyse the properties of the gauge groups, including the occurrence of specific individual 
gauge factors and the total rank. With respect to the chiral matter content, we establish the notion of a 
mean chirality and discuss their frequency distribution. 

In a second part we perform a search for models with the properties of a supersymmetric standard 
model. Besides the frequency distributions in the gauge sector we analyse the values of the gauge couplings 
and compare our results to those of a recent statistical analysis of Gepner models ||48l |49l . In addition 
to standard model gauge groups we look also for models with a Pati-Salam, SU{5) and flipped SU{5) 
structure. 

In the last part we consider different aspects of the question of correlations of observables in the gauge 
sector and give an estimate how likely it is to find a three generation standard model in our setup. 



3.1 Statistics of six-dimensional models 

Before considering the statistics of realistic four-dimensional models, let us start with a simpler construc- 
tion to test the methods of analysis developed in section|2] We will use a compactification to six dimensions 
on a T^/Z2 orientifold, defined in appendix lA.2l The important question about the finiteness of solutions 
has been settled in section |231 so we can be confident that the results we obtain will be meaningful. To 
use the saddle point approximation in this context, we have to generalise from the eight-dimensional ex- 
ample in l2.2.2l to an approximation in several variables, as described by equations (|22] | and ( |23] ). In our 
case we will have to deal with two variables q ^ (qi, (72), corresponding to the two wrapping numbers 
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Fig. 9 Logarithimic plot of the number of solutions for compactifications on r*/Z2 for — 8 and different values of 
L = . The complex structures are fixed to ui = U2 — 1. The dotted line shows the result with multiple wrapping, 
the stared line gives the result with coprime wrapping numbers. 



Let us briefly consider the question of multiple wrapping. As shown in appendix lA.2.11 we can derive a 
constraint on the wrapping numbers X and Y, such that multiply wrapping branes are excluded. To figure 
out what impact this additional constraint has on the distributions, let us compare the number of solutions 
for different values of and L^, with and without multiple wrapping. The result is shown in figure|9] As 
could have been expected, the number of solutions with coprime wrapping numbers grows less fast then 
the one where multiple wrapping is allowed. 



3.1.1 Distributions of gauge group observables 



Using the saddle point method, introduced in section l2".2.1l we can evaluate the distributions for individual 
gauge group factors and total rank of the gauge group in analogy to the simple eight-dimensional example 
we pursued in section 12.2.21 Therefore we will fix the orientifold charges to their physical values, L — 
{Li, L2) — (8, 8). The probability to find one U (M) gauge factor can be written similar to (|38] | as 



P{M,L) 



1 



AA(L)(27ri)2 



dq exp 



-(ii + l)log9i - (L2 + l)log92 



log E ^r^gf^^ 



\xeSu 



(56) 



where we denoted with Su the set of all values for X that are compatible with the supersymmetry condi- 
tions and the constraints on multiple wrapping. The number of solution M{L) is given by 



U{L) 



dq exp 



E 



^1 X2 

, '^^ X, - (ii + l)loggi - (i2 + l)logg2 
1 - 9i 92 



(57) 



The resulting distribution for the probability of an U{M) factor, compared to the results of an exact 
computer search, is shown in figure [TO(a)| 
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Fig. 10 Distributions in the gauge sector of a compactification on r''/Z2. The complex structures are fixed to 
ui — U2 — 1. The dotted line is the result of an exact computation, the solid line shows the saddle point approximation 
to second order, (a) Probability to find an {U (M) gauge factor, (b) Distribution of the total rank of the gauge group. 



As in the eight-dimensional example we can evaluate the distribution of the total rank ( |40] |. As a 
generalisation of (HTt we obtain the following formula 



P(r,L) 



AA(i)(27ri)3 / 



dqdz exp 



^1 



^1 JCo 



-{Li + 1) loggi - (^2 + 1) log 92 - (r+l) logz 



(58) 



Figure |10(b)| shows the resulting distribution of the total rank, compared to the exact result. As one can 
see, the results of the saddle point analysis are much smoother then the exact results, which show a more 
jumping behaviour, resulting from number theoretical effects. These are strong at low L, which is also the 
reason that our saddle point approximation is not very accurate. In the present six-dimensional case the 
deviations are not too strong, but in the four-dimensional case their impact is so big that the result cannot 
be trusted anymore. These problems can be traced back to the small values of L we are working with, but 
since these are the physical values for the orientifold charge, we cannot do much about it. 



3.1.2 Chirality 

Since we are ultimately interested in calculating distributions for models with gauge groups and matter 
content close to the standard model, it would be interesting to have a measure for the mean chirality of the 
matter content in our models. 

A good quantity to consider for this purpose would be the distribution of intersection numbers lab 
between different stacks of branes. This is precisely the quantity we choose later in the four-dimensional 
compactifications. In the present case we use a simpler definition for chirality, given by 

X X'X^- (59) 
This quantity counts the net number of chiral fermions in the antisymmetric and symmetric representations. 
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Using the saddle point method, we can compute the distribution of values for x, using 

1 




dq exp 



'-xeSu 



-.0.1 



\X<£Su 



where Su,x ^ Sjj is the set of wrapping numbers that fulfills 



X. I - (^1 + 1) log gi - (i2 + l)logg2 
-9i 92 
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Fig. 11 Distribution of the mean chirality for r''/Z2, = = 8, mi = 1*2 = 1 



chi'^ (1/2) 



(60) 



The resulting distribution is shown in figure [TT] For the used values of ui,U2 — 1, X has to be a 
square, which can be directly deduced from the supersymmetry conditions (|93] |. The scaling turns out to be 
roughly P{x) — exp(— cy^). From this result we can conclude that non-chiral models are exponentially 
more frequent than chiral ones. This turns out to be a general property of the orientifold models that also 
holds in the four-dimensional case. 



3.1.3 Correlations 



In this section we would like to address the question of correlations between observables for the first time. 
We come back to this issue in section 13.61 The existence of such correlations can be seen in figure [12] 
where we plotted the distributions of models with specific total rank and chirality. The connection between 
both variables is given by the tadpole cancellation conditions, which involve the Na used for the definition 
of the total rank in (l40l i and the wrapping numbers Xa, which appear in the definition of the mean chirality 
X in ( l59] l. The distribution can be obtained from 



P{x,r,L) 




-(r+l) logz 



dqdz exp 



Xi Xo 

zQi 92 



^1 

xeSu 



Xi X2 

zQi 92 
-^9^92''' 



log E - 



KXeSv 



X] Xo 

zQi 92 



X, X, I - (ii + l)loggi - (L2 + l)logg2 
zQi '92 



(61) 



which is a straightforward combination of dSSl l and ( |60] |. 
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Fig. 12 Correlation between total rank and chirality for Li = L2 = 8 and ui — U2 = 1 for a compactification on 
T'*/Z2. (b) shows the maximum of the total rank distribution depending on x- 



In figure |12(a)| one can see that the maximum of the rank distribution is shifted to smaller values for 
larger values of x- This could have been expected, since larger values of x imply larger values for the 
wrapping numbers X, which in turn require lower values for the number of branes per stack Na, in order 
to fulfill the tadpole conditions. The shift of the maximum depending of x, can be seen more directly in 
figure[T2(b)l 

3.2 Statistics of four-dimensional models 

Having tried our methods in compactifications down to six dimensions, let us now switch to the phe- 
nomenologically more interesting case of four-dimensional models. Unfortunately we can no longer use 
the saddle point approximation, since it turns out that in this more complicated case the approximation is 
no longer reliable. The results deviate significantly from what we see in exact computations. Furthermore 
the computer power needed to obtain the integrals numerically in the approximation becomes comparable 
to the effort needed to compute the solutions explicitly. 

3.2. 1 Properties of the gauge sector 

Using several computer clusters and the specifically adapted algorithm described in section 12.2.31 for a 
period of several months, we produced explicit constructions of w 1.6 x 10'* consistent compactifications 
on T^/Z2xZ2. The results presented in the following have been published in |69, 68 1, see also the analysis 
in ll78l and more recent results using brane recombination methods in 1 79 1 . 

Using this data we can proceed to analyse the observables of these models. The distribution of the 
total rank r of the gauge group is shown in figure |13(a)| An interesting phenomenon is the suppression 
of odd values for the total rank. This can be explained by the K-theory constraints and the observation 
that the generic value for is or 1 . Branes with these values belong to the first class of branes in 
the classification of section 12.3.21 and are those which lie on top of the orientifold planes. Therefore 
equation ( llOSI l suppresses solutions with an odd value for r. This suppression from the K-theory constraints 
is quite strong, the total number of solutions is reduced by a factor of six compared to the situation where 
these constraints are not enforced. 

Another quantity of interest is the distribution of U{M) gauge groups, shown in figure [T3(b)| We find 
that most models carry at least one U{\) gauge group, corresponding to a single brane, and stacks with a 
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(a) (b) 
Fig. 13 Frequency distributions of total rank and U{M) gauge groups of all models. 



higher number of branes become more and more unlikely. This could have been expected because small 
numbers occur with a much higher frequency in the partition and factorisation of natural numbers. 

3.2.2 Chirahty 

As in the six-dimensional case we want to define a quantity that counts chiral matter in the models under 
consideration. In contrast to the very rough estimate we used in section 13.1.21 this time we are going to 
count all chiral matter states, such that our definition of mean chirality is now 

2 4 ^ ^ 

k(k + l) ^ ^°'^-^°'-= fc(fc + ll ^ ^^^^ 

^ ' a,b=0,a<b ^ ' a,b=0,a<b 

In this formula the states from the intersection of two branes a and b are counted with a positive sign, 
while the states from the intersection of the orientifold image of brane a, denoted by a', and brane b are 
counted negatively. As we explained in section 12.1.11 and summarised in table [T] lab gives the number 
of bifundamental representations (Na, N^), while la'b counts (Nq, N;,). Therefore we compute the net 
number of chiral representations with this definition of x- By summing over all possible intersections and 
normalising the result we obtain a quantity that is independent of the number of stacks and can be used for 
a statistical analysis. 

A computation of the value of x according to (|62] | for all models leads to a frequency distribution of 
the mean chirality as shown in figure [14] This distribution is basically identical to the one we obtained in 
section [3".1.2l shown in figure[TT] In particular we also find that models with a mean chirality of dominate 
the spectrum and are exponentially more frequent then chiral ones. 

From the similarity with the distribution of models on T'^/Z-z we can also conjecture that there is 
a correlation between the mean chirality and the total rank, as we found it to be the case for the six- 
dimensional models in section !?. 1.31 Let us postpone this question to section [3761 where we give a more 
detailed account of several questions concerning the correlation of observables. 



3.3 Standard model constructions 

An important subset of the models considered in the previous section are of course those which could 
provide a standard model gauge group at low energies. More precisely, since we are dealing with super- 
symmetric models only, we are looking for models which might resemble the particle spectrum of the 
MSSM. 
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Fig. 14 Distribution of the mean cliirality x in compactification to four dimensions. 



To realise the gauge group of the standard model we need generically four stacks of branes (denoted by 
a,b,c,d) with two possible choices for the gauge groups: 



C/(3)„ X U{i)b X U{l)c X or {/(3)a x 5p(2)b x {/(I), x U{l)d. 



(63) 



To exclude exotic chiral matter from the first two factors we have to impose the constraint that ^Sym^j^f, = 
0, i.e. the number of symmetric representations of stacks a and b has to be zero. Models with only 
three stacks of branes can also be realised, but they suffer generically from having non-standard Yukawa 
couplings. Since we are not treating our models in so much detail and are more interested in their generic 
distributions, we include these three-stack constructions in our analysis. 

Another important ingredient for standard model-like configurations is the existence of a massless 
U{1)y hypercharge. This is in general a combination 



U{1) 



(64) 



including contributions of several U (l)s. Since we would like to construct the matter content of the stan- 
dard model, we are very constrained about the combination of U(l) factors. In order to obtain the right 
hypercharges for the standard model particles, there are three different combinations of the C/(l)s used to 
construct the quarks and leptons possible. 



1 



1, 



1. 



C/(l)i3' = l^U{l)a + -U{l\ + -U{l)d, 



C/(l)l?' 



6 

-\u{l)a-\u{l),, 



(65) 



where choices 2 and 3 are only available for the first choice of gauge groups. As explained in section i?. 1 .41 
we can construct a massless combination of U{1) factors, if (fTTI) is satisfied. This gives an additional 
constraint on the wrapping numbers Y . 

For the different possibilities to construct the hypercharge this constraint looks different. In the case of 
U{\)y^ the condition can be formulated as 



Ya + Y, + Yd = 0. 



(66) 
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Fig. 15 Assignment of brane intersections and chiral matter content for the first of the possible realisations of the 
standard model using intersecting branes. 



For Qy\ where the right-handed up-type quarks are realised as antisymmetric representations of U{3) 13] 
[TSi . we obtain 

Ya + Yb = 0, (67) 

and for Qy \ where we also need antisymmetric representations of f/(3) to realise the right-handed up- 
quarks, we get 

Ya + Yb-Yd^ 0. (68) 

In total we have found four ways to realise the standard model with massless hypercharge, summarised 
with the explicit realisation of the fundamental particles in tables|2]and[3] The chiral matter content arises 
at the intersection of the four stacks of branes. This is shown schematically for one of the four possibilities 
in figure [TSl 



3.3.1 Number of generations 

The first question one would like to ask, after having defined what a "standard model" is in our setup, 
concerns the frequency of such configurations in the space of all solutions. Put differently: How many 
standard models with three generations of quarks and leptons do we find? The answer to this question is 
zero, even if we relax our constraints and allow for a massive hypercharge (which is rather fishy from a 
phenomenological point of view). The result of the analysis can be seen in figure [T6l 

To analyse this result more closely, we relaxed our constraints further and allowed for different numbers 
of generations for the quark and lepton sector. This is of course phenomenologically no longer relevant, 
but it helps to understand the structure of the solutions. The three-dimensional plot of this analysis is 
shown in figure[T7] Actually there exist solutions with three generations of either quarks or leptons, where 
models with only one generation of quarks clearly dominate. The suppression of three generation models 
can therefore be pinned down to the construction of models with three generations of quarks, which arise 
at the intersection of the C/(3) with the SU{2)/ Sp{2) branes and the U{1) branes respectively, models 
with three generations of either quarks or leptons are shown in table |4] 

This result is rather strange, since we know that models with three families of quarks and leptons have 
been constructed in our setup (e.g. in Il20ll25l [8211341 ). A detailed analysis of the models in the literature 
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U{3)a X Sp{2)b X U{1), X Uil)d with q'^^ 
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(3,2)0.0 
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(3,l)_i,o + (3,l)o,-i 


la'c la'd 


dR 
dR 


(3, 1)1.0 + (3, l)oa 
(3a, 1)0,0 


la'c' + la'd' 
^{laa' + laOe) 


L 


(l,2)_i.o + (1,2)0,-1 


he + hd 


SR 
GR 
SR 


(1,1)2,0 
(1,1)0,2 
(1,1)1,1 


\{Icc' — IcOe) 

\{Idd' — IdO&) 
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U{3)a X C/(2)b X U{l)c X U{l)d with q(,'^ 
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L 
L 


(l,2)_i.o + (1,2)0,-1 
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he + hd 
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SR 
SR 
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(1,1)2,0 
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\{Icc' ~ leOe) 

\{hd' — Idoe) 

led' 



Table 2 Realisation of quarks and leptons for the two different choices of gauge groups J63t and hypercharge (1) in 
(|65). 



shows that all models which are known use (in our conventions) large values for the complex structure 
variables Uj and therefore did not appear in our analysis (see section |2. 2. 3l l. On the other hand we know 
that the number of models decreases exponentially with higher values for the complex structures. Therefore 
we conclude that standard models with three generations are highly suppressed in this specific setup. 

This brings up a natural question, namely: How big is this suppression factor? We postpone this question 
to section U.6. 21 where we analyse this issue more closely and finally give an estimate for the probability 
to find a three generation standard model in our setup. For now let us just notice that this probabiUty has 
to be smaller than the inverse of the total number of models we analysed, i.e. < 10^*. 

3.3.2 Hidden sector 

Besides the so called "visible sector" of the model, containing the standard model gauge group and parti- 
cles, we have generically additional chiral matter, transforming under different gauge groups. This sector 
is usually called the "hidden sector" of the theory, assuming that the masses of the additional particles are 
lifted and therefore unobservable at low energies. 
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Table 3 Realisation of quarks and leptons for hypercharges (2) and (3) of ( 165 1 ) . which can only be realised for the first 
choice of gauge groups in i63t . 
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Fig. 16 Number of quark and lepton generations with (red bars on the left) and without (blue bars on the right) 
enforcing amassless ?7(1). 



In figure 18(i 
sector. In 1 18(b) 



a)| we show the frequency distributions of the total rank of gauge groups in the hidden 
we show the frequency distribution of individual gauge group factors. Comparing these 
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# of quark gen. 


# of lepton gen. 


# of models 


1 
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183081 
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4 


136 


4 


3 


48 



Table 4 Number of models found with either three quark or three lepton generations. 



Log(# models! 




Fig. 17 Logarithmic plot of the number of models with different numbers of generations of quarks and leptons. Ql 
denotes the number of quark families, L is the number of lepton generations. 



results with the distributions of the full set of models in figure[T3] we observe that at a qualitative level the 
restriction to the standard model gauge group in the visible sector did not change the distribution of gauge 
group observables. The number of constructions in the standard model case is of course much lower, but 
the frequency distributions of the hidden sector properties behave pretty much like those we obtained for 
the complete set of models. 

As we argue in section 13.61 this is not a coincidence, but a generic feature of the class of models we 
analysed. Many of the properties of our models can be regarded to be independent of each other, which 
means that the statistical analysis of the hidden sector of any model with specific visible gauge group leads 
to very similar results. 

3.3.3 Gauge couplings 

The gauge sector considered so far belongs to the topological sector of the theory, in the sense that its 
observables are defined by the wrapping numbers of the branes and independent of the geometric mod- 
uli. This does not apply to the gauge couplings, which explicitly do depend on the complex structures. 
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Fig. 18 Frequency distributions of (a) total rank and (b) single gauge group factors in the hidden sector of MSSM- 
models (red bars on the left) and MSSM models with massive (7(1) (blue bars on the right). 



following the derivation in |T9l, which in our conventions reads 

, ^ Tx'Uj, (69) 

aa 2V2MsKa /rr3 pWpW 



where = 1 or 2 for an U{N) or Sp{2N) stack respectively. 

If one wants to perform an honest analysis of the coupling constants, one would have to compute their 
values at low energies using the renormalization group equations. We are not going to do this, but look 
instead at the distribution of a^/a^ at the string scale. A value of one at the string scale does of course not 
necessarily mean unification at lower energies, but it could be taken as a hint in this direction. 

To calculate the coupling ay we have to include contributions from all branes used for the definition of 
U{1)y- Therefore we need to distinguish the different possible constructions defined in ( l65T l. In general 
we have 

k 

— =V2iV,a;2^, (70) 
ay ^ aa 

a—l 

which for the three different possibiUties reads explicitly 
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ay' 
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H h 

at 



2—. (71) 

ad 

The result is shown in figure [T9(a)| and it turns out that only 2.75% of all models actually do show gauge 
unification at the string scale. 

Furthermore we analyse the distribution of values for the Weinberg angle 

sin^^^ , (72) 

ay + aw 
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Fig. 19 (a) Frequency distribution of Os/aw in standard model-like configurations, (b) Values of sin^O depending 
on aajo-w Each dot represents a class of models with these values. 



which depends on the ratio agjoLw We want to check the following relation between the three couplings, 
which was proposed in 1 19| and is supposed to hold for a large class of intersecting brane models 

+ (73, 
From this equation we can derive a relation for the weak mixing angle 



sin"^ = - / (74) 

The result is shown in figure [T9(b)l where we included a red line that represents the relation ( |73] ). The fact 
that actually 88% of all models obey this relation is a bit obscured by the plot, because each dot represents 
a class of models and small values for agjctw are highly preferred, as can be seen from figure [T9(a)l 



3 1 



3.3.4 Comparison with the statistics of Gepner models 

In this paragraph we would Uke to compare our results with the analysis of ll48l l49l . where a search for 
standard model-like features in Gepner model constructions ll64ll63l 12311271 has been performed. 

To do so, we have to take only a subset of the data analysed in the previous sections, since the authors 
of ll48l l49l restricted their analysis to a special subset of constructions. Due to the complexity of the 
problem they restricted their analysis to models with a maximum of three branes in the hidden sector and 
focussed on three-generation models only. Since the number of generations does not modify the frequency 
distributions and we obtained no explicit results for three generation models, we include models of an 
arbitrary number of generations in the analysis. To match the first constraint we filter our results and 
include only those models with a maximum of three hidden branes. But, as we will see, this does also not 
change the qualitative behaviour of the frequency distributions. 

In figure |20] we show the frequency distribution of the dimension of the hidden sector gauge group 
before (a) and after (b) the truncation to a maximum of three hidden branes. Obviously the number of 
models drops significantly, but the qualitative shape of the distribution remains the same. Figure [20(b)| can 
be compared directly with figure 5 of |09|. From a quahtative point of view both distributions are very 
similar, which could have been expected since the Gepner model construction is from a pure topological 
point of view quite similar to intersecting D-branes. A major difference can be observed in the absolute 
values of models analysed. In the Gepner case the authors of PP9l found a significantly larger amount of 
candidates for a standard model. 
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Fig. 20 Frequency distribution of the dimension of the hidden sector gauge group. Figure (a) is the full set of models, 
figure (b) shows the subset of solutions with a maximum of three branes in the hidden sector. 



Besides the frequency distribution of gauge groups we can also compare the analysis of the distribution 
of gauge couplings. In particular, the distribution of values for for sin^ depending on the ratio a^jois, 
figure [T9(b)] can be compared with figure 6 of B9l . We find, in contrast to the case of hidden sector gauge 
groups, very different distributions. While almost all of our models are distributed along one curve, in the 
Gepner case a much larger variety of values is possible. The fraction of models obeying ( |73]) was found to 
be only about 10% in the Gepner model case, which can be identified as a very thin line in figure 6 of ||49l . 
This discrepancy might be traced back to the observation that in contrast to the topological data of gauge 
groups we are dealing with geometrical aspects here. 

As explained in the last paragraph, the gauge couplings do depend explicitly on the geometric moduli. A 
major difference between the Gepner construction and our intersecting D-brane models lies in the different 
regimes of internal radius that can be assumed. In our approach we rely on the fact that we are in a 
perturbative regime, i.e. the compactification radius is much larger than the string length and the string 
coupling is small. 

3.4 Pati-Salam models 

As in the case of a S'C/(3) x SU{2) x U{1) gauge group, we can try to construct models with a gauge 
group of Pati-Salam type 

SU{4:) X SU{2)l X SU{2)r. (75) 

Analogous to the case of a standard model-like gauge group, we analysed the statistical data for Pati-Salam 
constructions, realised via the intersection of three stacks of branes. One brane with Na = 4 and two stacks 
with Ni,/c = 2, such that the chiral matter of the model can be realised as 

Ol- (4,2,1), Qfl = (4,l,2). (76) 

One possibility to obtain the standard model gauge group in this setup is given by breaking the SU{4:) 
into SU{3) X U{1) and one of the SU{2) groups into U{1) x U{1). This can be achieved by separating 
the four branes of stack a into two stacks consisting of three and one branes, respectively, and the two 
branes of stack 6 or c into two stacks consisting of one brane each. The separation corresponds to giving 
a vacuum expectation value to the fields in the adjoint representation of the gauge groups U{Na) and 
U (A^)f,/c, respectively. 

Models of this type have been constructed explicitly in the literature, see e.g ll38l l37l l35l [33l l34l |29l . 
However, one has to be careful comparing these models with our results, since our constraints are stronger 
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compared to those usually imposed. In particular, we do not allow for symmetric or antisymmetric rep- 
resentations of SU{4), a constraint that is not always fulfilled for the models that can be found in the 
references above. 



Log(# models) 



Fig. 21 Logarithmic plot of the number of Pati-Salam models found, depending on the number of generations. The 
solutions have been restricted to an equal number of left- and right-handed fermions, i.e. gen. — Ql ^ Qr 



A restriction on the possible models, similar to the standard model case, is provided by the constraint 
that there should be no additional antisymmetric matter and the number of chiral fermions transforming 
under SU (2)l and SU{2)ji should be equal. 

As can be seen in figure [21] we found models with up to eight generations, but no three-generation 
models. The conclusion is the same as in section [33] - the suppression of three generation models is 
extremely large and explicit models show up only at very large values of the complex structure parameters. 
The distribution differs from the standard model case in the domination of two-generation models. This is 
an interesting phenomenon, which can be traced back to the specific construction of the models using two 
N ^ 2 stacks of branes. This example shows that the number of generations, in contrast to the distribution 
of gauge groups in the hidden sector (see also section l376] l. does depend on the specific visible sector gauge 
group we chose. 



3.5 SU(5) models 

From a phenomenological point of view a very interesting class of low-energy models consist of those 
with a grand unified gauge groujH providing a framework for the unification of the strong and electro- 
weak forces. 

The minimal simple Lie group that could be used to achieve this is SU{5) |62| or also the so-called 
flipped SU{5) 11 11 143 1, consisting of the gauge group SU{5) x U{l)x- They represent the two possi- 
bilities how to embed an SU{5) gauge group into 5*0(10). The flipped construction is more interesting 
phenomenologically, because models based on this gauge group might survive the experimental limits on 
proton decay. Several explicit constructions of supersymmetric SU (5) models in the context of intersect- 
ing D-brane models are present in the literature Il36l l7l l27ll26l[30l[28j . as well as some non-supersymmetric 
ones HSIESI. 

In the remainder of this section we present some results on the distribution of the gauge group properties 
of SU{5) and flipped SU{5) models, using the same T^/Z2 x Z2 orientifold setting as in the previous 
sections. This part is based on ITTl . 



For an introduction see e.g. (931 or the corresponding chapters in I31II58I . 
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3.5.1 Construction 

In the original SU (5) construction, the standard model particles are embedded in a 5 and a 10 representa- 
tion of the unified gauge group as follows 

S'C/(5) SU{i) X SU{2) X U{1)y, 

5 ^ (3,l)2/3 + (l,2)_i, 
10 ^ (3,l)_4/3 + (3,2)i/3 + (l,l)2, (77) 
where the hypercharge is generated by the SU{'i) x SU (2)-invariant generator 

Z = diag(-l/3, -1/3, -1/3, 1/2, 1/2). (78) 
In the flipped SU (5) construction, the embedding is given by 
SU{5) X U{l)x SU{S) X SU{2) x U{1)y, 

5_3 ^ (3,l)_4/3 + (l,2)-l, 

lOi ^ (3,l)2/3 + (3,2)i/3 + (l,l)o, 

I5 ^ (1,1)2, (79) 
including a right-handed neutrino (1, l)o. The hypercharge is in this case given by the combination 

Y = -^Z +^X. (80) 
5 5 

We would like to realise models of both type within our orientifold setup. The SU (5) case is simpler, 
since in principle it requires only two branes, a U{5) brane a and a U{1) brane b, which intersect such that 
we get the 5 representation at the intersection. The 10 is realised as the antisymmetric representation of the 
U{5) brane. To get reasonable models, we have to require that the number of antisymmetric representations 
is equal to the number of 5 representations, 

lab = -#Anti„. (81) 

In a pure SU{5) model one should also include a restriction to configurations with #Sym^ = to 
exclude 15 representations from the beginning. Since it has been proven in ll36l that in this case no three 
generation models can be constructed and symmetric representations might also be interesting from a 
phenomenological point of view, we include these in our discussion. 

The flipped SU (5) case is a bit more involved since in addition to the constraints of the SU (5) case one 
has to make sure that the U{l)x stays massless and the 5 and 10 have the right charges, summarised in 
( l79t . To achieve this, at least one additional brane c is needed. Generically, the U{l)x can be constructed 
as a combination of all [/(l)s present in the model 

fc 

U{l)x ^Y.^-^^^^- (82) 

a=l 

The simplest way to construct a combination which gives the right charges would be 

C/(l)^-ic/(l),-^C/(l)fc + ^{/(l)e, (83) 



but a deeper analysis shows ||96ll , that this is in almost all cases not enough to ensure that the hypercharge 
remains massless. The condition for this can be formulated as 

fc 

J2xaNaYa^Q, (84) 



with the coefficients Xa from (I83I I. To fulfill this requirement we need generically one or more additional 
C/(l) factors. 
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Fig. 22 Logarithmic plot of the number of solutions with an SU (5) factor depending on the absolute value of the pa- 
rameters U. We give the results with (blue bars to the left) and without (red bars to the right) symmetric representations 
of S(/(5). 



3.5.2 General results 

Having specified the additional constraints, we use the techniques described in section l2.2.3l to generate as 
many solutions to the tadpole, supersymmetry and K-theory conditions as possible. The requirement of 
a specific set of branes to generate the SU{5) or flipped SU{5) simplifies the computation and gives us 
the possibility to explore a larger part of the moduli space as compared to the more general analysis we 
described above. 

Before doing an analysis of the gauge sector properties of the models under consideration, we would 
like to check if the number of solutions decreases exponentially for large values of the Uj, as we observed 
in section [J.2. II for the general solutions. In figure|22]the number of solutions with and without symmetric 
representations are shown. The scaling holds in our present case as well, although the result is a bit 
obscured by the much smaller statistics. In total we found 2590 solutions without restrictions on the 
number of generations and the presence of symmetric representations. Excluding these representations 
reduces the number of solutions to 914. Looking at the flipped SU{5) models, we found 2600 with and 
448 without symmetric representations. Demanding the absence of symmetric representations is obviously 
a much severer constraint in the flipped case. 

The correct number of generations turned out to be the strongest constraint on the statistics in our 
previous work on standard model constructions. The SU (5) case is not different in this aspect. In figure |23] 
we show the number of solutions for different numbers of generations. We did not find any solutions with 
three 5 and 10 representations. This situation is very similar to the one we encountered in our previous 
analysis of models with a standard model gauge group in section 13.31 An analysis of the models which 
have been explicitly constructed showed that they exist only for very large values of the complex structure 
parameters. The same is true in the present case. Because the number of models decreases rapidly for 
higher values of the parameters, we can draw the conclusion that these models are statistically heavily 
suppressed. 

Comparing the standard and the flipped SU (5) construction the result for models with one generation 
might be surprising, since there are more one generation models in the flipped than in the standard case. 
This is due to the fact that there are generically different possibilities to realise the additional U{l)x factor 
for one geometrical setup, which we counted as distinct models. 

Regarding the hidden sector, we found in total only four SU{5) models which did not have a hidden 
sector at all - one with 4, two with 8 and one with 16 generations. For the flipped SU (5) case such a model 
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Fig. 23 Plots of the number of solutions for different numbers of generations for (a) 5(7(5) and (b) flipped SU{5) 
models with (blue bars to the left) and without (red bars to the right) symmetric representations of SU (5). 
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Fig. 24 Logarithmic plots of the number of solutions with a specific rank M gauge factor in the hidden sector in 
(a) SU{5) and (b) flipped SU{5) models with (blue bars to the left) and without (red bars to the right) symmetric 
representations of SU{5). 



cannot exist, because it is not possible to solve the condition for a massless U{l)x without hidden sector 
gauge fields. 

The frequency distribution of properties of the hidden sector gauge group, the probability to find a 
gauge group of specific rank M and the distribution of the total rank, are shown in figures|24]and|25] The 
distribution for individual gauge factors is qualitatively very similar to the one obtained for all possible 
solutions above (see figures [13]). One remarkable difference between standard and flipped SU{5) models 
is the lower probability for higher rank gauge groups. This is due to the above mentioned necessity to have 
a sufficient number of hidden branes for the construction of a massless U{l)x- 

The total rank distribution for both, the standard and the flipped version, differs in one aspect from 
the one obtained in 13.2.11 namely in the large fraction of hidden sector groups with a total rank of 10 
or 9, respectively. This can be explained by just one specific construction which is possible for various 
values of the complex structure parameters. In this setup the hidden sector branes are all except one on top 
the orientifold planes on all three tori. If we exclude this specific feature of the SU{5) construction, the 
remaining distribution shows the behaviour estimated from the prior results. 
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Fig. 25 Plots of the number of solutions for given values of the total rank of the hidden sector gauge group in 
(a) SU (5) and (b) flipped SU (5) models with (blue bars to the left) and without (red bars to the right) symmetric 
representations of SU{5). 



Note that while comparing the distributions one has to take into account that the total rank of the hidden 
sector gauge group in the SU (5) case is lowered by the contribution from the visible sector branes to the 
tadpole cancellation conditions. In the flipped case, the additional C/(l)-brane contributes as well. 

3.5.3 Restriction to three branes in the hidden sector 

In order to compare our results for the statistics of constructions with a standard model-like gauge group 
with Gepner models in section [J. 3. 4! we truncated the full set of models to those with only three stacks of 
branes in the hidden sector. In the following we also perform a restriction to a maximum of three branes 
in the hidden sector in the SU (5) case, but with a different motivation and in a different way. We do 
not truncate our original results, but instead impose the constraint to a maximum of three branes from 
the very beginning in the computational process. It turns out that such a restriction can greatly improve 
the performance of the partition algorithm and allows us therefore to analyse a much bigger range of 
complex structures. This is highly desirable, since it opens up the possibility to check some claims about 
the growths of solutions that we made in section |23] The method has also some drawbacks. Since we 
do not compute the full distribution of models, but with an artificial cutoff, we can not be sure that the 
frequency distributions of properties in the gauge sector are the same as in the full set of models. As we 
will see in the following, there are indeed some deviations. 

In figure |26] we plotted the total number of models with a maximum of three stacks of branes in the 
hidden sector. As in our analysis above we show the models without symmetric representations separately. 
This plot should be compared with figure |22] the number of solutions for SU{5) models without restric- 
tions. In the restricted case we were able to compute up to much higher values of the complex structures 
and confirm the assertion of l2.3l that the number of solution drops exponentially with \U\. This provides 
another hint that the total number of solutions is indeed finite. In total we found 3275 solutions, which 
is more then in the case without restrictions, but in contrast to a range of complex structures which is 25 
times bigger, the amount of additional solutions is comparably small. 

Comparing the distributions for individual gauge factors (figure [27(a)| i and the total rank in the hidden 



sector (figure |27(b)[ ), we see some interesting differences to figures 24(a)| and |25(a)| The distribution of 



individual gauge factors is just extended to higher factors in the restricted case. This was to be expected, 
since larger values for the complex structure parameters allow for larger gauge factors to occur, since they 
provide us with very long branes with negative wrapping numbers X that can compensate these large 
numbers in the tadpole cancellation conditions. The general shape of the distribution remains unchanged. 
In the case of the total rank the situation is different. The distribution also shows larger values for the total 
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Fig. 26 Logaritiimic plot of the number of solutions with an SU (5) factor depending on the absolute value of the 
parameters U. The number of brane stacks in the hidden sector is restricted to three and the results are shown for 
models with (blue spikes) and without (red spikes) symmetric representations of 5(7(5). 




Fig. 27 Logarithmic plots of the frequency distributions in the hidden sector of SU (5) models with a maximum of 
three hidden branes. (a) Specific rank M gauge factors, (b) Total rank of the hidden sector gauge group. 



rank, which is directly correlated to the larger individual ranks of the factors, but moreover the maximum 
of the distribution is shifted from around seven in the unrestricted case to about four. This can be explained 
by the fact that the restriction to a maximum of three branes in the hidden sector also restricts the possible 
contributions from models with many gauge factors of small rank, especially the contribution of U{1) 
gauge factors. 

What about models with a flipped SU{5) gauge group? Repeating the analysis for these models in the 
case of a restriction in the hidden sector can of course be done, be the results might not be very predictive. 
For a consistent flipped SU{5) model, we need a massless U{l)x, which also depends on a combination 
of C/(l) factors from the hidden sector After choosing an additional U{1) brane for the visible sector of 
flipped SU (5) there remain only two hidden sector branes. This restriction is too drastic to give meaningful 
results, since it turned out in the analysis of flipped SU{5) models that we need more than two hidden sector 
branes to solve the equations for the U{l)x to be massless. 

The analysis in this section showed that three generation models with a minimal grand unified gauge 
group are heavily suppressed in this specific orientifold setup. This result was expected, since we know 
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that the expHcit construction of three generation SU{5) models using the Z2 x Z2 orbifold has turned out 
to be difficult. 

The analysis of the hidden sector showed that the frequency distributions of the total rank of the gauge 
group and of single gauge group factors are quite similar to the results for generic models in section U. 2. II 
Differences in the qualitative picture result from specific effects in the SU (5) construction. 

Comparing the results for the standard and flipped SU{5) models, we find no significant differences. If 
we allow for symmetric representations, there is basically no additional suppression factor If we restrict 
ourselves to models without these representations, flipped constructions are three times less likely then the 
standard ones. 

3.6 Correlations 

An interesting question that we raised in the introduction concerns the correlation of observables. If differ- 
ent properties of our models were correlated, independently of the specific visible gauge group, this would 
provide us with some information about the generic behaviour of this class of models. In the following 
discussion we would like to clarify this point, emphasizing a crucial difference between correlations of 
phenomenologically interesting observables in the gauge sector of our models on the one hand, and cor- 
relations between basic properties used as constraints to characterize a specific visible sector on the other 
hand. Finally we use the observations on the second class of correlations to estimate the number of mod- 
els with a standard model gauge group and three generations of quarks and leptons for the jTL^ x Z2 
orientifold. 

3.6. 1 Rank and chirality 

To give an example of correlations between gauge group observables let us consider the mean chirality 
X, defined by (|62] |. and the total rank of the gauge group. As we already saw using the saddle point 
approximation on T'* in section 13.1.31 these two quantities should be correlated. To confirm this in the 
four-dimensional case, we use our explicit results and compute the frequency distributions for the different 
visible sectors considered above, standard model-like constructions with and without a massless hyper- 
charge and Pati-Salam models. The result is shown in figure|28] Please note that we have normalised the 
distributions in order to make the results better comparable. 

We find two striking results here, which illustrate the two points we made in the introduction to this 
section. Firstly the two observables are clearly correlated, a large value for the mean chirality is much 
more likely to find if the total rank is small. Secondly the results for the full set of models, figure [28(a)] 
and the different visible sectors, figures |28(b)[ (c) and (d), show qualitatively very similar results. This 
last observation is intriguing, since we might use this to conjecture that the specific properties used to 
define an individual visible sector do not influence the distributions. Put differently, we might speculate 
that these properties could be regarded independent of each other. If this would be indeed the case, it could 
simplify some specific analysis dramatically. Instead of constructing solutions for one specific setup with 
some set of properties it would be enough to know the probabilities for each property. Since they would 
be independent of each other we could just multiply the results and get an answer to our more difficult 
question. 

3.6.2 Estimates 

We would like to test this conjecture using the properties of a standard model construction. These include 
several constraints on the models, in particular the existence of specific U (N) gauge factors, the vanishing 
of antisymmetric representations, a massless hypercharge and three generations of chiral matter. How can 
we check whether two of these properties A and B, are independent? A good measure for this would be 
to calculate the correlation between the probabilities P{A) and P{B) to find these properties. This can be 
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Fig. 28 Logarithmic plots of the relative frequency distributions of models with specific total rank of the gauge group 
and mean chirality. Plot (a) shows the analysis for the full gauge group of all models, figures (b), (c) and (d) give the 
results for the hidden sector gauge groups of standard model-like constructions with and without a massive hypercharge 
and Pati-Salam models, respectively. 



expressed as 



P{A)P{B)-P{AAB) 
P{A)P{B)+P{AaB) 



(85) 



where P{A A B) is the probability to find both properties realised at the same time. 

For concreteness let us take the following properties as examples: The existence of a U{3) gauge group, 
existence of a [/ (2) or Sp{2) gauge group and the vanishing of antisymmetric representations. In figure |29l 
we plotted the value of Pab in the set of all models for different values of the number of stacks. As can 
be derived from these plots the two properties are not really independent, but values of about 0.1 and 0.2, 
respectively, which are also the order of magnitude for other possible correlations, suggest that one could 
give it a try and treat these properties as independent in an estimat^ 

In table|5]we summarised the properties of a three-generation standard model, including the suppression 
factor calculated using the probability to find this property in the set of all models and their total number, 
1.66 X 10*. The two U{1) gauge groups required for a standard model setup are not included in this, since 
the probability to find a U{1) in one of the constructions is essentially one. Multiplying all these factors. 



Note that the independence of different properties have been an assumption that was used in the original work on vacuum 
statistics 1531. 
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(a) 



(b) 



Fig. 29 Correlations between properties of standard model-like configurations, (a) Correlation between the existence 
of an SU{3) and an 5(7(2) or Sp{2) gauge group, (b) Correlation between the existence of an SU{3) gauge group 
and the absence of symmetric representations. 



Restriction 


Factor 


gauge factor U (3) 


0.0816 


gauge factor C/(2)/S'p(2) 


0.992 


No symmetric representations 


0.839 


Massless U{1)y 


0.423 


Three generations of quarks 


2.92 X 10-^ 


Three generations of leptons 


1.62 X 10~3 


Total 


1.3 X 10-^ 



Table 5 Suppression factors for various constraints of standard model properties. 



we get a probability of « 1.3 x 10 , i.e. one in a billion, to find a three-generation standard model in the 
T^/ZzxZz setup. 

How reliable is this estimate? This is of course an important question, since we concluded from the 
analysis above that the basic properties are only approximately independent and we can not really make 
a quantitative statement about the possible error in our estimate. So let us compare the result we obtain 
with this method for models with standard model gauge group and two or four generations of quarks and 
leptons with the actual numerical results we have obtained in these cases. 

The result is shown in table|6] As can be read of this table, the estimate for the two- and four-generation 
case deviates by around 20% from the correct value. Keeping this in mind and further noting that we are 
making an estimate only at an order-of-magnitude level, a suppression factor of w 10^^ seems to be a 
reliable value. 



4 Conclusions and outlook 

In this work we have reviewed the results on the statistics of the gauge sector of a specific class of ori- 
entifold models. We have presented two different methods to derive these results. The saddle point ap- 
proximation flTl, working well in the eight- and six-dimensional case, is not powerful enough to deal with 



42 



F. Gmeiner: Gauge sector statistics of intersecting D-brane models 



# generations 


# of models found 


estimated # 


suppression factor 


2 


162921 


188908 


« 10-^ 


3 





0.2 


« 10-9 


4 


3898 


3310 


w 2 X 10"^ 



Table 6 Comparison between the estimated number of solutions and the actual number of solutions found for models 
with two, three and four generations. 



four-dimensional compactifications, forcing us to perform a more direct computer aided analysis ll69l . 
Using this approach, we discussed various aspects of frequency distributions in the gauge sector. After 
exploring the most general case, we focused on models containing phenomenologically interesting gauge 
groups. In the particular case of a standard model gauge group [68,|, an estimate of the number of three 
generation models in this setup was given, using the fact that the basic properties of such a model are 
sufficiently uncorrected. For this class of models we also analysed the values of the gauge couplings at the 
string scale. In the case of models containing a Pati-Salam or SU{5) gauge group |71 1, it was shown that 
the frequency distributions of gauge factors do not change, indicating that the specific choice of a visible 
sector does not alter the statistics. 

Concerning the universality of these results, it should be stressed that it is very likely that some of them 
depend strongly on the specific geometry that has been chosen. The comparison of some of our results with 
a study of Gepner models f4E\ confirms this conjecture, showing that only those results show similarities, 
which are fairly independent of the geometry. Especially the amount of three generation standard models 
might be very different if one chooses other orbifold groups. To illuminate this point, a study of models 
on T^/Zg is currently under way |70|. It would be of course very desirable to do similar studies in other 
corners of the landscape, in order to see whether the distribution of models resembling our four dimensional 
world is uniform or rather sharply peaked at special points in moduh space. 

Interesting directions of future studies would be on the one hand to include fluxes in our discussion, 
which probably give rise to a much larger set of models. On the other hand one could try to obtain frequency 
distributions of other important aspects of the (supersymmetric) standard model, such as Yukawa couplings 
or soft supersymmetry breaking terms. Including these in our discussion will certainly reduce the number 
of acceptable models and might also give additional hints which parts of the landscape are worth to be 
investigated in greater detail. 

A Orientifold models 

In this appendix we summarise the concrete examples of orientifold models that are used in this paper. We 
fix the notation and translate the conditions explained in general in section ITT] into variables that suit the 
specific cases and simplify the computations. 

A.l 

For compactification on T^, a special Lagrangian submanifold is specified by two wrapping numbers 
{na,rna) around the fundamental one-cycles. In this case these numbers are precisely identical to the 
numbers {Xa, Ya) used in section lzTl 

The tadpole cancellation condition ^ reads 

Y^NaXa = L, (86) 

a 

where the physical value is L = 16. 



43 



The first supersymmetry condition of d?) reads just 

Ya = 0, (87) 

and is independent of the complex structure U = R2/R1 on the rectangular torus. This implies that 
all supersymmetric branes must lie along the x-axis, i.e. on top of the orientifold plane. The second 
supersymmetry condition in (|7]l becomes 

Xa > 0. (88) 

From these conditions we can immediately deduce that if one does not allow for multiple wrapping, as it 
is usually done in this framework, there would only exist one supersymmetric brane, namely the one with 

(x,r) = (i,o). 

A.2 T^/Za 

In this case a class of special Lagrangian branes is given by so-called factorisable branes, which can be 
defined by two pairs of wrapping numbers (n^, rrii) on two T^s. The wrapping numbers (X*, ) with 
i = 1,2 for the Z2 invariant two-dimensional cycles are then given by 

— ni n2, X^ = mi m2, — ni ni2, Y^ = mi n^- (89) 

To simplify matters we sometimes use a vector notation X = {X^, X'^)'^ and Y — {X^ , X'^)'^ . 

Note that these branes do not wrap the most general homological class, for the 2-cycle wrapping num- 
bers satisfy the relation 

However, for a more general class we do not know how the special Lagrangians look like. Via brane 
recombination it is known that there exist flat directions in the D-brane moduli space, corresponding to 
branes wrapping non-flat special Lagrangians. Avoiding these complications, we use the well understood 
branes introduced above only. 

The untwisted tadpole cancellation conditions read 

^ Na Xl = L\ = -L\ (91) 

a a 

with the physical values Li = L2 ~ 8. In order to put these equations on the same footing, we change the 
sign of X2 to get 

J2 Na Xl = L\ - L\ (92) 



Note that in contrast to models discussed for example in 11651 . we are only considering bulk branes 
without any twisted sector contribution for simplicit}fl Defining the two form = {dxi+iUidyi){dx2-\- 
iU2dy2), the supersymmetry conditions become 

Ui Y^ + U2Y^ = 0, X^ + Ui U2 X^ > 0. (93) 

The intersection number between two bulk branes has an extra factor of two 

lab = -2 {Xl Xl + X^^ xl + Yl Y^ + Yl Y^) . (94) 



For a treatment of fractional branes in this framework see e.g. ||13II14|. 
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A. 2.1 Multiple wrapping 

In the case of it made no sense to restrict the analysis of supersymmetric branes to those which are 
not multiply wrapped around the torus, because there would have been just one possible construction. In 
the case of jTj^ the situation is different and we would like to derive the constraints on the wrapping 
numbers X and Y . 

For the original wrapping numbers n^, the constraint to forbid multiple wrapping is gcd(ni, to^) = 
1 V i = 1,2. Without losing information we can multiply these two to get 

gcd(ni,77ii)gcd(n2,m2) = 1. (95) 

Using the definitions ( |89] l of X and Y , we can rewrite this as 

gcd(x^y2)gcd(x^y2) - 1-2, (96) 

which is invariant under an exchange of X and Y . 



A.3 T^'/ZaxZa 

In the case of compactifications on this six-dimensional orientifold, which has been studied by many 
authors (see e.g. 1611 [38ll37ll80ll57l [161 ) the situation is very similar to the four-dimensional case above. We 
can describe factorisable branes by their wrapping numbers {ni,mi) along the basic one-cycles 7r2i-i, Tr2i 
of the three two-tori = li'^^^T^. To preserve the symmetry generated by the orientifold projection ila, 
only two different shapes of tori are possible, which can be parametrised by bi E {0, 1/2} and transform 
as 

na : I ^ . (97) 

For convenience we work with the combination 7r2i-i = T^2i-i — biTT2i and modified wrapping numbers 
rhi = rrii + hiUi. Furthermore we introduce a rescaling factor 

c:- |^n(l-6,)^ (98) 

to get integer-valued coefficients. These are explicitly given by (z, j^k E {1,2,3} cyclic) 

X'^ — cnin2ri3, X^ = —cnirhjfnk, 

Y'^ = cTOiTO2?7i3, y = —crhinjUk- (99) 

The wrapping numbers X and Y are not independent, but satisfy the following relations: 

XiYi = XjYj, Xl{YlY = XiXjXk, ^^^^^ 

XiXj = YkYl, YLiXL? = YiYjYk, 

for all /, J, X, L e {0, . . . 3} cyclic. 

Using these conventions the intersection numbers can be written as 

lab = ^ [XaYh - XbYa) . (101) 

The tadpole cancellation conditions read 
fc 

J2NaXa=L, L=\ I , (102) 
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where we used that the value of the physical orientifold charge is 8 in our conventions. 
The supersymmetry conditions can be written as 

5]^=0, ^X^t//>0, (103) 

7=0 ^ 7=0 

where we used that the complex structure moduli [// can be defined in terms of the radii {R\^\ R^p ) of 
the three tori as 

TT - 7?(2) n(3) 

U, = RfR':^'^Rf\ i, j,fc e {1,2, 3} cyclic. (104) 
Finally the K-theory constraints can be expressed as 

^iV„y^"e2Z, i_^^iv,rj e 2Z, {1,2,3}. (105) 

a=l a—1 



A. 3.1 Multiple wrapping 

We can define the condition to exclude multiple wrapping in a way similar to the T^'-case. A complication 
that arises is the possibility to have tilted tori. In the definition of X and Y in ( l99b we used the wrapping 
numbers m^, which have been defined to include the possible tilt. To analyse coprime wrapping numbers, 
however, we have to deal with the original wrapping numbers m^, such that 

3 

ngcdK,m,) = l. (106) 

We can express this condition in terms of the variables X and Y, defined as 

X° = nin2n3, Y° = mim2m-i, 

X^ = UiUjUk, Y^ = miUjUk, (107) 

where i,j,k £ {1,2,3} cyclic, analogous to section lA.2.11 



ngccl(f",l^) = (f°)^ (108) 

i=l 

The X and Y can be expressed in terms of the X and Y of ( |99] l. using their definition ( IIO6I 1 and the 



factor ^ 


ill, as 






X" = 








X' ^ 




+ hjY*' + 


bkY^ +bjbkX''), 










y° = 












1=1 


1=1 J 


Y' = 




-hx"). 





(109) 
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B Partition algorithm 

In this part of the appendix we briefly outline the partition algorithm used in the computer analysis of 
vacucfl It is designed to calculate the unordered partition of a natural number n, restricted to a maximal 
number of m factors, using only a subset F C N of allowed factors to appear in the partition. 

To describe the main idea, let us drop the additional constraints on the length and factors of the partition. 
They can be added easily to the algorithm, for details see the comments in listing |2] The result is stored 
in a list {a^}, which is initialized with ai = ?i(5i,i. An internal pointer q is set to the first element at the 
beginning and after each call of the main routine the list a contains the next partition. The length of this 
partition is stored in a variable to, which is set to m = 0, after the last partition has been generated. 

The main routine contains the following steps. It checks if the element Og is equal to 1 - if yes, it sets 
q = q — 1. This is repeated until Og > 1 or q = - in this case no new partitions exist, to is set to and 
the algorithm terminates. In the second step the routine sets Uq — ~ 1, a^+i ~ ciq+i + 1 and q = q + 1. 
But this operation is only performed if Uq^i < aq and aq > 1, otherwise the counter q is reduced by one 
and the algorithm starts over. 

Let us give an example to illustrate this procedure. Consider the unordered partitions of 5: 

{{5},{4,1},{3,2},{3,1,1},{2,2,1},{2,1,1,1},{1,1,1,1,1}}. (110) 

Starting with 5 itself, the first time we call the algorithm, it decreases ai to oi — 4, increases 02 to 02 = 1, 
which generates the partition {4,1}. The pointer q is increased to g = 2. The next time we call the routine, 
the element Uq = 02 is equal to 1, which leads to q — 1. Now the condition > 1 is satisfied and the 
result of aq — aq — 1, Og+i = ciq+i + 1 gives the partition {3, 2}. Continuing in this way, four more 
partitions of 5 are generated, until we reach {1,1,1,1,1}. We have = 1 for alH ~ 1, . . . , 5, which 
leads to the termination of the algorithm in the first step. 

B . 1 Implementation 

The algorithm uses a data structure part it i on to collect the necessary parameters and internal variables: 

typedef struct .partition { long n ,m, q , * fac , * a , min ; } partition; 

Here n e N is the number to be partitioned and m holds the length of the partition list a. The array fac 
contains the set F of allowed values of partition factors, min and q are internal variables to be explained 
below. Besides these internal variables, a global variable maxp is used, which contains the maximal length 
of the partition. 

The algorithm itself is split into two parts. The function apartitions_f irst is called once at the 
beginning of the program loop that runs through all partitions. It initializes the internal variables n and 
fac and calculates the minimum possible value for a partition factor from the list fac. Finally it checks 
if n itself is contained in fac and calls the main routine apart it ions_next if this is not the case. 

void a p a r t i t i o n s _ f i r s t ( long n, long *f, partition *p) { 
long i ; 

/* check if we're supposed to do anything */ 
if ((n>0)&&(maxp==0)) { 
p-^=0; 
return ; 

} 

/* find minimum and check consistency */ 
p— >min=n + 1 ; 

i=l; 

while (i<=n) { 
if (f[i]>0) { 



The complete program used to generate the solutions, which is written in C, can be obtained from the author upon request. 
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p— >min=i ; 
i=n+l ; 
} else { 
i ++; 

} 

} 

i f ( p->min>n ) { 
p->ni=0; 
return ; 

} 

/* init data structure */ 
p— >n=n ; 
p— >fac=f ; 

p— >a=malloc ((n + l)*sizeof(long)); 

p->a[0]=p->n; 

p— >m= 1 ; 

p— >a [l] = p— >n ; 

p->q=l; 

/* generate first partition (check if n is allowed...} */ 
if (f[n]<=0) { 

apartitions.next(p); 

} 

} 

Listing 1 Partition algorithm, initial routine 

The main routine can be called subsequently as long as the length m of the partition hst a is positive. 
Each call will produce a new partition of n. Special care has to be taken if elements of the partition are not 
contained in f ac - see the coimnents in the source code for these subtleties. 

void ap a r t i t i o n s _n e X t ( p ar t i t i o n *p) { 

/* set the number n what we have to distribute to 0. */ 
p->n=0; 

/* go back until there is a value bigger then the minimum min to distribute 
and the partition doesn't get too long. */ 

while ( ( p->q>=maxp ) | | ( ( p->q>0)&&(p->a [p->q]==p->min ) ) ) { 

p— >n=p— >n+p— >a [p— >q ] ; 

p->q=p->q-l; 

} 

/* loop through the distribution process as long as we're not back at the 
beginning of the factor list. */ 
while (p->q>0) { 

/* lower the actual value at q we're trying to distribute by 1 and add 1 to 
the distribution account, then increase the list —length m by one. */ 

p->a [ p->q ] = p->a [ p->q] - 1 ; 

p— >n=p->n + l; 

p— >nt=p— >q + l; 

/* as long as the new factor is > then the one before or it is not in 
fac , subtract 1 from it (and add 1 to n). do this as long as it is > 
then the minimum. */ 

while (((p->a[p->q]>p->a[p->q- 1])| |(p->fac [p->a [p->q]] <=0)) 
&&(p->a [ p->q] >=p->min ) ) { 
p->a [ p->q ] = p->a [ p->q ] - 1 ; 
p— >n=p— >n+l ; 

} 

/* check if the new factor is lower or equal then the one before and it 's 

in fac (the loop above might have terminated on the minimum condition ). 

if yes, add the distribution sum to the new factor at q + 1. if not, add the 
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whole factor at q to n and go one step hack in the list. */ 
if ((p->a[p->q]<=p->a[p->q-l])&&(p->fac [p->a[p->q]] >0)) { 
p->q=p->q + l; 
p->a[p->q]=p->n; 

/* // the new factor is < then the one before and in our list return. */ 
if ((p->a[p->q]<=p->a[p->q-l])&&(p->fac [p->a[p->q]] >0)) { 

return ; 
} else { 

/* so the new factor is not smaller or in our list — means we have to 
redistribute some of it to a new factor, but if we are already at the 
maximum length of the partition we have to go one step back! */ 
if (p— >q < maxhidden ) { 

p->n=0; 
} else { 

p->q=p->q - 1 ; 

} 

} 

} else { 

p— >n=p— >n+p— >a [p— >q ] ; 
p->q=p->q - 1 ; 

} 

} 

/* ('/ the pointer is q is there is nothing left to do — free memory and 
return for the length of the partition */ 
if (p->q <= 0) { 

f r e e ( p— >a ) ; p— >m= ; 

} 

} 

Listing 2 Partition algorithm, main routine 
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